A Multimedia Retrieval System for Retrieving Chinese Text and Speech Documents
نویسنده
چکیده
Multimedia documents place new requirements on the conventional text retrieval systems. This paper presents a multimedia retrieval system that employs the contentbased strategy to retrieve both text and speech documents. Its input can be a sequence of spoken words which are digitized waveforms or a sequence of characters, and its output is a list of ranked text and/or speech documents. In this system, a new metadata especially designed for both text and speech documents is proposed. The metadata is automatically generated with special consideration of the characteristics of Chinese. The presented approach is very easy to implement and the preliminary tests give very encouraging results.
منابع مشابه
Metadata for Integrating Chinese Text and Speech Documents in a Multi-media Retrieval System
Multimedia documents place new requirements on the conventional text retrieval systems. This paper presents a multimedia retrieval system that employs the content-based strategy to retrieve both text and speech documents. Its input can be a sequence of spoken words which are digitized waveforms or a sequence of characters, and its output is a list of ranked text and/or speech documents. In this...
متن کاملRetrieving of Video Scenes Using Arabic Closed-caption
The increased use of video documents for multimedia-based applications has created a demand for strong video database support, including efficient methods for browsing and retrieving video data. Most solutions to video browsing and retrieval of video data rely on visual information only, ignoring the rich source of the accompanying audio signal and texts. Speech is the significant information t...
متن کاملLarge-vocabulary Chinese Text/speech Information Retrieval Using Mandarin Speech Queries
The network technology and the Internet are creating a completely new information era. It is believed that in the near future numerous of digital libraries and a great variety of multimedia databases, which consist of heterogeneous types of information including text, audio, image, video and so on, will be available worldwide via the Internet. This paper deals with the problem of Chinese text a...
متن کاملSpoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting
Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...
متن کاملRetrieving Video Segments Based on Combined Text, Speech and Image Processing
This paper describes a multimedia, multilingual and multimodal research system (CIMWOS) supporting content-based indexing, archiving, retrieval and ondemand delivery of audiovisual content. There are several projects, aiming at developing advanced technologies and systems to tackle the problems encountered in multimedia archiving and indexing [8], [9], [10]. CIMWOS [1] (Combined IMage and WOrd ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006